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1. Introduction 



The universality of random matrices can be roughly divided into the bulk universality in the interior of the 
spectrum and the edge universality near the spectral edge. Over the past two decades, spectacular progress on 
bulk and edge universality has been made for invariant ensembles, see e.g. [3J \5\ [HI [22] and [2121111 for a review. 
For non-invariant ensembles with i.i.d. matrix elements (Standard Wigner ensembles), edge universality can 
be proved via the moment method and its various generalizations; see e.g. [331 Hil HI] ■ I n order to establish 
bulk universality, a new approach was developed in a series of papers [HI [TOJ [TT1 [T2J Q2J [TSJ HH [T7] based on 
three basic ingredients: (1) A local semicircle law - a precise estimate of the local eigenvalue density down 
to energy scales containing around N e eigenvalues. (2) The eigenvalue distribution of Gaussian divisible 
ensembles via an estimate on the rate of decay to local equilibrium of the Dyson Brownian motion [7]. 
(3) A density argument which shows that for any probability distribution there exists a Gaussian divisible 
distribution with identical eigenvalue statistics down to scales 1/N. In [T7], edge universality is established 
as a corollary of this approach. It asserts that, near the spectral edge, the eigenvalue distributions of two 
generalized Wigner ensembles are the same provided the first two moments of the two ensembles match. 

Another approach to both bulk and edge universality was developed in [26l [27l [20] . Using this approach, 
the authors show that the eigenvalue distributions of two standard Wigner ensembles are the same in the 
bulk, provided that the first four moments match. They also prove a similar result at the edge, assuming 
that the first two moments match and the third moments vanish. 

In this paper, partly based on the approach of |17, . we extend edge universality to eigenvectors associated 
with eigenvalues near the spectral edge, assuming the matching of the first two moments of the matrix 
entries. We prove that, under the same two-moment condition as in |17j . the edge eigenvectors of Hcrmitian 
and symmetric Wigner matrices have the same joint distribution as those of the corresponding Gaussian 
ensembles. The joint distribution of the eigenvectors of Gaussian ensembles is well known and can be easily 
computed. More generally, we prove that near the spectral edge the joint eigenvector-eigenvalue distributions 
of two generalized Wigner matrix ensembles coincide provided that the first two moments of the ensembles 
match and one of the ensembles satisfies a level repulsion condition. 

We also prove similar results in the bulk, under the stronger assumption that the first four moments 
of the two ensembles match. In particular, we extend the result of [26] to cover the universality of bulk 
eigenvectors. 

1.1. Setup. We now introduce the basic setup and notations. Let H" = H = (/ijj)ij=i be an N x N 
Hcrmitian or symmetric matrix whose upper-triangular matrix elements hij = hji, i ^ j, arc independent 
random variables with law Uij having mean zero and variance afj : 

Ehi 3 - = 0, a^:=K\hij\ a . (1.1) 

The law Vij and its variance afj may depend on N, but we omit this fact in the notation. We denote by 
B := (ofj)fj =1 the matrix of the variances. We shall always make the following three assumptions on H. 

(A) For any fixed j we have 

i=l 

Thus B is symmetric and doubly stochastic and, in particular, satisfies — 1 ^ B ^ 1. 
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(B) There exists constants <5_ > and 5+ > 0, independent of N, such that 1 is a simple eigenvalue of B 
and 

spec(.B) c [-1 + <5_,1 -6 + ] U{1}. 

(C) There exists a constant Co, independent of A, such that a 2 j ^ CoA -1 for all i,j = l,...,N. 

Examples of matrices satisfying Assumptions (A) - (C) include Wigner matrices, Wigncr matrices whose 
diagonal elements are set to zero, generalized Wigner matrices, and band matrices whose band width is of 
order cA for some c > 0. See [T7], Section 2, for more details on these examples. 

In our normalization, the matrix entries hij have a typical variance of order A" 1 . It is well known that 
in this normalization the empirical eigenvalue density converges to the Wigner semicircle law g sc (E) dE with 
density 

Qsc(E) := ^-^{4-E 2 )+ for BeM. (1.3) 

In particular, the spectral edge is located at ±2. We denote the ordered eigenvalues of H by Ai ^ . . . ^ Ajv, 
and their associated eigenvectors by u 1; . . . , Ujy- The eigenvectors are £ 2 -normalized. We use the notation 
u Q = (u a (i))iL 1 for the components of the vector u Q . 

Our analysis relies on a notion of high probability which involves logarithmic factors of A. The following 
definitions introduce convenient shorthands. 

Definition 1.1. We set L = L N := A log log A for some fixed A as well as ip = tp N := (log A) 108108 ^. 

Definition 1.2. We say that an A-dependent event f2 holds with high probability if P(f2) ^ 1 — e~ v<= for 
large enough A and some c > independent of A. 

A key assumption for our result is the following level repulsion condition, which is in particular satisfied 
by the Gaussian ensembles (sec Remark 11.51 below) . Consider a spectral window whose size is much smaller 
than the typical eigenvalue separation. Roughly, the level repulsion condition says that the probability of 
finding more than one eigenvalue in this window is much smaller than the probability of finding precisely 
one eigenvalue. In order to state the level repulsion condition, we introduce the following counting function. 
For any E\ ^ Ei we denote the number of eigenvalues in [E\, E2] by 

N(E U E 2 ) := #{j : E± < Xj < E 2 } . 

Definition 1.3 (Level repulsion at the edge). The ensemble H is said to satisfy level repulsion at the edge 
if, for any C > 0, there is an cto > such that the following holds. For any a satisfying < a ^ ao there 
exists a S > such that 

P(V(£- N- 2/3 - a ,E + N- 2/3 - a ) ^ 2) ^ N-°- s (1.4) 
for all E satisfying \E + 2| s$ N- 2 / 3 (p c . 

Definition 1.4 (Level repulsion in the bulk). The ensemble H is said to satisfy level repulsion in the bulk 
if, for any n > 0, there is an a > such that the following holds. For any a satisfying < a ^ a there 
exists a 5 > such that 

P(Af(E-N- 1 - a ,E + N~ 1 - a )^2) ^ N-"- 5 (1.5) 

for all E G [-2 + k,2- k]. 
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Remark 1.5. Both the Gaussian Unitary Ensemble (GUE) and the Gaussian Orthogonal Ensemble (GOE) 
satisfy level repulsion in sense of Definitions 11.31 and 11.41 This can be established for instance as follows; 
see [T|, Sections 3.5 and 3.7, and in particular Lemmas 3.5.1 and 3.7.2, for full details. For GUE and 
GOE, the correlation functions can be explicitly expressed in terms of Hermite polynomials. Using Laplace's 
method, one may then derive the large-TV asymptotics of the correlation functions, from which (|1.4p and 
(|1.5[) immediately follow. (Note that in [I], the exponent of N in the error estimates was not tracked in 
order to simplify the presentation.) 

In the more general case of Wigner matrices, level repulsion in the bulk, (jl.5[) . was proved for matrices 
with smooth distributions in |llj and without a smoothness assumption in |26j . 

We shall use the level repulsion condition of Definition 11.31 to estimate the probability of finding two 
eigenvalues closer to each other than the typical eigenvalue separation. For definiteness, we formulate this 
estimate at the lower spectral edge —2. By partitioning the interval 

-2 - N- 2/3 tp c , -2 + N~ 2/3 tp c 

into 0((p c N a ) subintervals of size N~ 2 / 3 ~ a , we get from (| 1 .4[) that for any sufficiently small a there exists 
a 6 > such that 

P (there exists E with \E + 2| ^ N- 2/3 tp c such that Af(E - N- 2/3 - a ,E + TV- 2 / 3 -") ^ 2) < N~ 5 . 

(1.6) 

A similar result can be derived in the bulk using (jl.5p . 

1.2. Results. Before stating our main results, we recall the definition of the classical eigenvalue locations. 
Let 

n sc (E) := / g sc (x)dx (1.7) 
J — 00 

be the integrated distribution function of the semicircle law. We use 7 Q = Y a ,jv to denote the classical 
location of the a-th eigenvalue under the semicircle law, defined through 

™ SC (7a) = "^r. (1-8) 

To avoid unnecessary technicalities in the presentation, we shall assume that the entries hij of H have 
uniform sub-exponential decay, i.e. 

F(\hij\ ^ xaij) iT 1 cxp(-V) (1.9) 

1) > is some fixed constant. As observed in [S], Section 7, one may easily check that all of our results hold 
provided the subexponential condition (|1.9[) is replaced with the weaker assumption that there is a constant 
C such that 

where Co is a large universal constant. 

Our main result on the distributions of edge eigenvectors is the following theorem. 

Theorem 1.6 (Universality of edge eigenvectors). Let H v and H w both satisfy Assumptions (A) 
- (C) as well as the uniform subexponential decay condition (|1.9|) . Let E v and E w denote the expectations 
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with respect to these collections of random variables. Suppose that the level repulsion estimate (| 1 .4|) holds 
for the ensemble H v . Assume that the first two moments of the entries of H v and H™ are the same, i.e. 

E v h l lJ h^ = E^hljhfj for 0^l + u^2. (1.10) 

Let p be a positive constant. Then for any integer k and any choice of indices i\, . . . i k , ji, ■ ■ ■ ,jk> fix, ■ ■ ■ fik 
and a>i, . . .au with min(|a;|, \ai — N\) + min(|/3;|, — N\) ^ ip p N for all I we have 

lim [E v -E w Wa/ 2 / 3 (A 01 ~ lf}l ),...,N 2 ^{\ Pk - 7/3 J ; Nu^i^u^), . . . , Nu ak (i k )u ak {j k j) = 0, 

(i.n) 

where 9 is a smooth function that satisfies 

\d n 9(x)\ ^ C{l + \x\f (1.12) 

for some arbitrary C and all n & N 2k satisfying \n\ ^ 3. The convergence is uniform in all the parameters 
il,jl,ai,pi satisfying the above conditions. 

Remark 1.7. The scaling in front of the arguments in (|1.11[) is the natural scaling near the spectral edge. 
Indeed, for e.g. GUE or GOE it is known (see e.g. [1]) that (Xp — yp) ~ N~ 2 ^ 3 near the edge, and that 
u a (i) ~ N 1 / 2 (complete delocalization of eigenvectors). 

Remark 1.8. The form (|l.lip characterizes the distribution of the edge eigenvectors completely. Choosing 
h = ji yields the modulus |u Q; (i/)| 2 ; fixing z; and varying ji gives the relative phases of the entries of the 
vector u ai , which is only defined up to a global phase. 

Remark 1.9. Theorem ll.6l and Rcmark [l.5l implv that the joint eigenvector-eigenvalue distribution of Hcrmi- 
tian Wigner matrices agrees with that of GUE. In the case of GUE, it is well known that the joint distribution 
of the eigenvalues is given by the Airy kernel [29] . The eigenvectors are independent of the eigenvalues, and 
the matrix (u a (i)) a ^ of the eigenvector entries is distributed according to the Haar measure on the unitary 
group U(N). In particular, any eigenvector u Q is uniformly distributed on the unit (N — l)-sphere. 

Similarly, Theorem 11.61 and Remark ll.5l imply that the joint eigenvector-eigenvalue distribution of sym- 
metric Wigner matrices agrees with that of GOE. Results similar to those outlined above on the eigenvector- 
eigenvalue distribution of GUE hold for GOE. 

The universality of the eigenvalue distributions near the edge was already proved in [T7] under the as- 
sumption that the first two moments of the matrix entries match, and in [57] under the additional assumption 
that the third moments vanish. Note that Theorem 11.61 holds in a stronger sense than the result in [17]: it 
holds for probability density functions, not just the distribution functions. 

In the bulk, a result similar to Theorem 11.61 holds under the stronger assumption that four, instead of 
two, moments of the matrix entries match. 

Theorem 1.10 (Universality of bulk eigenvectors). Let H v and H w both satisfy Assumptions (A) 
- (C) as well as the uniform subexponential decay condition (|1.9|) . Suppose that the level repulsion estimate 
(|1.5j) holds for the ensemble H v . Suppose moreover that the first four off-diagonal moments of H v and _ff w 
are the same, i.e. 

E v h\ ] h'^ j = E^h^hfj for i^j and 0^( + m^4, (1.13) 
and that the first two diagonal moments of H v and iJ w are the same, i. e. 

E v h\X 3 = ^hiiKj for 0s$/ + «^2. (1.14) 
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Let p > be fixed. Then for any integer k and any choice of indices i\,...ik, ji, ■ ■ ■ ,jk, o,s well as 
pN < ai, . . . afc,/3i, . ..,/3fc < (1 - p)A, we We 

lim [E v -E w ]0(iV(A ft -7 / 3j,...,iV(A fc -7£j;i^ =0, (1.15) 

where is a smooth function that satisfies 

\d n 9{x)\ < C(l + |x|) C (1.16) 

for some arbitrary C and all n £ N 2fc satisfying \n\ ^ 5. XTie convergence is uniform in all the parameters 
Hi jucth Pi satisfying the above conditions. 

The universality restricted to the bulk eigenvalues only has been previously established in several works. 
The following list provides a summary. Note that the small-scale statistics of the eigenvalues may be studied 
using correlation functions, which depend only on eigenvalue differences, or using joint distribution functions, 
as in (jl.lip and (|1.15[) . which in addition contain information about the eigenvalue locations. 

(i) In |17j . bulk universality for generalized Wigner matrices was proved in the sense that correlation 
functions of bulk eigenvalues, averaged over a spectral window of size N e , converge to those of the 
corresponding Gaussian ensemble. 

(ii) In [26] the statement (|1.15[) on distribution functions, restricted to eigenvalues only, was proved for 
Hcrmitian and symmetric Wigner matrices for the case where the first four moments match as in (| 1 . 1 3|) . 

(iii) For the case of Hermitian Wigner matrices with a finite Gaussian component, it was proved in |19j 
that the correlation functions converge to those of GUE. 

(iv) In |18j . the joint distribution function of the eigenvalues of GUE was computed. This result was 
extended to cover GOE in [21] . 

Note that (ii) and (iii) together imply the universality of the joint distribution of eigenvalues for Hcrmitian 
Wigner matrices, for which the first three moments match those of GUE and the distribution is supported 
on at least three points. Moreover, combining (ii) and (iv) allows one to compute the eigenvalue distribution 
of Hcrmitian and symmetric Wigner matrices, provided the four first moments match those of GUE/GOE. 

Thus, Theorem 11.151 extends the results of [17) to distribution functions of individual eigenvalues as well 
as to eigenvectors. 

Remark 1.11. A while after this paper was posted online, a result similar to Theorem 11.101 appeared in [25] . 
Its proof relies on a different method. The hypotheses of [28] are similar to those of Theorem 11.101 with 
the two following exceptions. The result of [28] is restricted to Wigner matrices instead of the generalized 
Wigner matrices defined by Assumptions (A) - (C). Moreover, in [25] the derivatives of the observable 9 
are required to be uniformly bounded in x, where this uniform bound may grow slowly with N. This latter 
restriction allows the authors of [25] to let fc grow slowly with N. 

While the results of [25] apply to eigenvectors near the spectral edge, the matching of four moments (as in 
Theorem 1 1.1 Op is also required for this case. As shown in Theorem II .61 the universality of edge eigenvectors 
in fact only requires the first two moments to match. 
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1.3. Outline of the proof. The main idea behind our proof is to express the eigenvector components 
using matrix elements of the Green function G(z) = (H — z)^ 1 . To this end, we use the identity 



E 



77 /lT N 

(p nn 2 nu p (i)u (j) = —(GiiiE + i^-G^E-in)), (1.17) 

{£j — Ap) + 77 Z7T1 

where 77 > 0. Using a good control on the matrix elements of G{z), we may then apply a Green function 
comparison argument (similar to the Lindeberg replacement strategy) to complete the proof. For definiteness, 
let us consider a single eigenvalue A Q located close to the spectral edge —2. 

In a first step, we write Nu a (i)u a (j) as an integral of (|1.17j) over an appropriately chosen (random) 
domain, up to a negligible error term. We choose 77 in (|1.17j) to be much smaller than the typical eigenvalue 
separation, i.e. we set 77 = N~ 2 ^ 3 ~ £ for some small e > 0. Note that the fraction on the left-hand side 
of (|1.17[) is an approximate delta function on the scale 77. Then the idea is to integrate f| 1 . 1 T[) over the 
interval [X a — if > 77, X a + p c rf\ for some large enough constant C. For technical reasons related to the Green 
function comparison (the third step below), it turns out to be advantageous to replace the above interval with 
I := [A Q _i +Lp c rj 7 X a + ip c rj\. Using eigenvalue repulsion, we infer that, with sufficiently high probability, the 
eigenvalues A a _i and A a +i are located at a distance greater than (p 77 from X a . Therefore the ^-integration 
over I of the right-hand side of (j!.17[) yields N u a (i)u a (j) up to a negligible error term. 

In a second step, we replace the sharp indicator function 1(E £ I) with a smoothed indicator function 
expressed in terms of the Green function G. This is necessary for the Green function comparison argument, 
which requires all i7-dependence to be expressed using Green functions. To that end, we choose a scale 
t) := iY~ 2/3 ~ 6,r < 77 and write 



(1.18) 



where the error is negligible. Here El = — 2 — (p c N~ 2 / 3 , q is a smooth function equal to 1 in the 1/3- 
neighbourhood of a — 1 and vanishing outside the 2/3-neighbourhood of a — 1, and 8^ is the approximate 
delta function defined in (|2.16[) below. Thanks to the special form of the right-hand side of (|2.16[) . we have 
9 V (H) = — ImG(i7?). Hence the argument on right-hand side of ()1.18[) may be expressed as an integral over 
Green functions. Thus we have expressed Nu a (i)u a (j) using matrix elements of G alone. Note that the 
above choice of I was made precisely so as to make the right-hand side of (|1.18|) a simple function of G. 

In a third step, we use a Green function comparison argument to compare the distributions of Nu a (i)u a (j) 
under the two ensembles H v and ff w . The basic strategy is similar to [17], but requires a more involved 
analysis of the resolvent expansion. The reason for this is that we need to exploit the smallness associated 
with off-diagonal elements of G, which requires us to keep track of their number in the power counting. This 
bookkeeping is complicated by the presence of the two fixed indices i and j. Another important ingredient 
in the error estimates of the Green function comparison argument is the restriction of the integration over 
£ to a deterministic interval of size ip N~ 2 / 3 around -2. This can be done with negligible errors using the 
eigenvalue rigidity proved in |17) ; see Theorem 12.21 

The above proof may be easily generalized to multiple eigenvector components as well as to eigenvalues; 
this allows us to consider observables of the form given in (|1.11[) . The necessary changes are given in Section 
II 

The proof for bulk eigenvectors is similar, with two major differences. At the edge, the convolution 
integral on the right-hand side of (|1.18p was over a domain of size <p c N~ 2 / 3 . If the same expression were 
used in the bulk, this size would be O(l) (since E is separated from the spectral edge —2 by a distance of 
order 0(1)), which is not affordable in the error estimates. Instead, a more refined multiscale approach using 
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the Hclffcr-Sjostrand functional calculus is required in order to rewrite the sharp indicator function on the 
left-hand side of (|1.18p in terms of Green functions. The second major difference for bulk eigenvectors is the 
power counting in the Green function comparison argument, which is in fact easier than at the edge. The 
main reason for this is that the smallness associated with off-diagonal elements of G is not available in the 
bulk. Hence we need to assume that four instead of two moments match, and the intricate bookkeeping of 
the number of off-diagonal resolvent elements is not required. Thus, thanks to the very strong assumption 
of four- moment matching, the proof of Theorem 11.101 is considerably simpler than that of Theorem 11.61 Sec 
Section [5] for a more detailed explanation as well the proof. 

Conventions. We shall use the letters C and c to denote generic positive constants, which may depend on 
fixed quantities such as "d from (jl.9[) , S± from Assumption (B), and Co from Assumption (C). We use C for 
large constants and c for small constants. 

Acknowledgements. The authors would like to thank L. Erdds and H.T. Yau for many insights and helpful 
discussions. 



2. Local semicircle law and rigidity of eigenvalues 

In this preliminary section we collect the main tools we shall need for our proof. We begin by introducing 
some notation and by recalling the basic results from [17] on the local semicircle law and the rigidity of 
eigenvalues. 

We define the Green function of H by 

GM = (jj^l) . (2-1) 

where we the spectral parameter x — E + irj satisfies E <G K and rj > 0. The Stieltjes transform of the 
empirical eigenvalue distribution of H is defined as 

m( Z ) := l5>(z) = = ^Ex"b- 

i a 

Similarly, we define m sc (z) as the Stieltjes transform of the local semicircle law: 

m sc {z) := J . 

It is well known that m sc (z) can also be characterized as the unique solution of 

m sc (z) + 1 - = (2.3) 
z + m sc (z) 

with positive imaginary part for all z with Imz > 0. Thus, 



-z + Vz 2 -4 

m sc (z) = , (2.4) 

where the square root function is chosen with a branch cut in the segment [—2, 2] so that asymptotically 
\/ z 2 — 4 ~ z at infinity. This guarantees that the imaginary part of m sc is non- negative for r\ = Imz > 
and in the limit 77 — ^ 0. 
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In order to state the local semicircle law, we introduce the control parameters 

A d := max | Gu - m sc | , A„ := max|Gy-|, A := \m — m ac \, (2.5) 

where the subscripts refer to "diagonal" and "off-diagonal" matrix elements. All these quantities depend on 
the spectral parameter z and on N, but for simplicity we often omit the explicit mention of this dependence 
from the notation. The following two results were proved in |17) . 

Theorem 2.1 (Strong local semicircle law). Let H = (h VJ ) be a Hermitian or symmetric N x N 
random matrix satisfying Assumptions A C. Suppose that the distributions of the matrix elements hij 
have a uniformly subexponential decay in the sense of (|1.9p . Then there exist positive constants Aq > 1. 
C, c, and r < 1, such that the following estimates hold for L as in Definition \l.l\ and for N iVo($, Co,6±) 
large enough. 

(i) The Stieltjes transform of the empirical eigenvalue distribution of H satisfies 

P (U { A (*)>^^H) ^(^r\ (2.6) 

where 

S L := {z = E + in : |£|^5, AT'^log AT) 10i < v <: 10 j . (2.7) 
(ii) The individual matrix elements of the Green function satisfy 

F ( U {M'HAoM > (log^y^^ + ^i^}) < e-O*"^ . (2.8) 

(Hi) The norm of H is bounded by 2 + Af~ 2 / 3 (log N) 9L in the sense that 

P(\\H\\ > 2 + Af" 2/3 (logAr) 9i ) e -=(i°gJV) Tl . (2.9) 

The local semicircle law implies that the eigenvalues are close to their classical locations with high 
probability. Recall that Ai A2 • • • ^ An are the ordered eigenvalues of H . The classical location j a of 
the ct-th eigenvalue was defined in (|1.7[) . 

Theorem 2.2 (Rigidity of eigenvalues). Under the assumptions of Theorem \2.1\ there exist positive 
constants Aq > 1, C,c, and r < 1, depending only on $ in (|1.9|) . S± in Assumption (B), and Cq in 
Assumption (C), such that such that 

P{3a : |A Q -7a| > (logAr) L [min(a,Ar-a + l)] _1/ 'V- 2 / 3 } e -c(lo S Ny L j (21Q) 

where L is given in Definition ll.ll 

A simple consequence of Theorem 1 2. II is that the eigenvectors of H are completely delocalizcd. 
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Theorem 2.3 (Complete delocalization of eigenvectors). Under the assumptions of Theorem [Ql 
we have 

'sa,i : Mi)| 2 ^} < e~^ N y L (2.11) 
for some positive constants C and c. 

PROOF. Using (|2.8p and (|2.9j) wc have, with probability greater than 1 - e~ c Q° sN ) T , 

C > Im G «(A Q + i„) = E rA 2 > ^ 

^ (A Q - A^) 2 + n- n 

Choosing r\ = N~ 1 (logN) 20L yields the claim. □ 

2.1. Stability of the level repulsion condition. In this section we prove that level repulsion, in the 
sense of (|1.4p (respectively (|1.5p ). holds for the ensemble i? w provided it holds for the ensemble H v and the 
first two (respectively four) moments of the entries of H v and H™ match. 

Proposition 2.4 (Stability of level repulsion at the edge). Let H v and H w both satisfy Assump- 
tions (A) - (C) as well as the uniform subexponential decay condition (|1.9[) . Assume moreover that the 
first two moments of the entries of H v and H w are the same, in the sense of (jl.lOp . If the level repulsion 
estimate (|1 .4[) holds for H v then it holds for i? w . 

Proposition 2.5 (Stability of level repulsion in the bulk). Let H v and H w both satisfy Assump- 
tions (A) - (C) as well as the uniform subexponential decay condition (|1.9[) . Assume moreover that the first 
four moments of the entries of H v and i? w are the same, in the sense of (|1.13|) and (|1.14p . If the level 
repulsion estimate 111.50 holds for H v then it holds for H w . 



The proofs of Propositions 12 .41 and 12.51 are very similar. For definiteness, we give the details for the edge 
case (Proposition I2.4p . The rest of this section is devoted to the proof of Proposition ^. 41 The main tool is 
the following Green function comparison theorem, which was proved in |17j . Theorem 6.3. 

Lemma 2.6 (Green function COMPARISON theorem at the edge). Suppose that the assumptions of 
Theorem \2.1\ hold for both ensembles H v and iJ w . Let F : R — > K be a function whose derivatives satisfy 



max 



\F in \x)\(l + \x\)- Cl ^ d for n = l,2,3,4, (2.12) 



with some constant C\ > 0. Then there exists a constant Sq > 0, depending only on C\, such that for any 
e < Eq and for any real numbers E\ and E2 satisfying 

\Ex + 2| < N~ 2/3+e , \E 2 + 2| < N~ 2/3+e , 

we have 

< CA^ 1/6+c,e (2.13) 



and 



[E V -E W ] J F 1 ^ J 2 dy lmm(y + iN- 2/3 - £ ) 

[E v -E w ]f(7V7/ I mm (E 1 +iiV" 2/3+6 )) < CN- l ' &+Ce , (2.14) 

for some constant C and large enough N , depending only on C\, •& in (|1.9p . 8± in Assumption (B), and Co 
in Assumption (C). 
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The basic idea behind the proof of Proposition 12.41 is to first cast the level repulsion estimate into an 
estimate in terms of Green functions and then use the Green function comparison theorem. Recalling L from 
Dcfinition ll.il we set 

E L := -2-2(logN) L N- 2/3 . (2.15) 

For any E > E L let 

XE ■= ~L[E L ,E\ 

be the characteristic function of the interval [El, E]. For any 77 > we define the approximate delta function 
Or, on the scale r\ through 

O n (x) := J 1 = Ilm— ^. (2.16) 
ir{x z + n z ) 7r x — IT] 

The following result provides a tool for estimating the number operator using Green functions. It is proved 
in [T7], Lemma 6.1 and Corollary 6.2. 

Lemma 2.7. Suppose that the assumptions of Theorem \2.1\ hold, and let Aq and r be as in Theorem \2.1\ 
For any e > 0, set l\ :~ J\l~ 2 / 3 ~ 3e a nd r\ :~ N~ 2 ^~ 9e . Then there exist constants C,c such that, for any E 
satisfying 

\E + 2\N 2 ' 3 < |(logJV) L , (2.17) 

we have 

\Tv XE (H)-TY X E*O r ,(H)\ ^ C(N- 2£ +Af(E-£ 1 ,E + £ 1 )) (2.18) 

with high probability. 

Moreover, let I := ^£iN 2e = i_/V~ 2 / 3 ~ £ . Then under the above assumptions the inequalities 

Tr( X E-i*O ri )(H)-N- s «C tf(-oo,E) Tr( X E+e * V ){H) + N~ e (2.19) 
hold with high probability. 

After these preparations we may complete the proof of Proposition ^. 41 



PROOF of Proposition 12.41 Assume that H v satisfies the level repulsion assumption (|1.4j) with constant 
aQ. We shall show that H™ satisfies also satisfies Definition (|1.4|) with the same constant ao. Fix a satisfying 
< a ^ ao , and let S > be as chosen so that (|1.4|) holds for the ensemble H v . 

Abbreviate E± = E ± iV- 2 /3-« an( j set £ .- 2a. By using (pTTOj) for E = E + and E = £L, and 
subtracting the resulting two inequalities, we get, with high probability, 

Tr(l[ E -+e,E + -e\ * 0„)(H) - 2N~ £ < N{E_,E+) < Tr(l [E __ i>E++i] * V )(H) + 2N~ e . (2.20) 

Let F be a nonnegativc increasing smooth function satisfying F(x) = 1 for x ^ 2 and F(x) = for x ^ 3/2. 
Then, using (|2.20j) and Lemma \2~M we have 

E^F(Af(E^,E+)) < E v >F(Tr(l [E __^ E++i] *0 v )(H)+2N- £ ) 

< E v F(Tr(l [E __ iyE++e] * 9 V )(H)) + CN~ e + CN- 1 '^ 

^ W F(N(E- - 21, E+ + 21) + N- £ ) + CN~ e + CN- 1/6+Ce 

< N- a - s + CN- £ + CN- 1/6+Ce . 

Since e = 2a, we get that (|1.4[) holds for the ensemble H™ with exponent 5' = min{5, a} > 0. □ 
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3. Proof of Theorem 11.61 



To simplify presentation, in this section we prove Theorem ll.6l in the special case 9 = 0(Nu a (i)u a (j)) , where 
a ^ ip p . The proof of the general case is analogous; see Section @] for more details. 

In a first step we convert the eigenvector problem into a problem involving the Green function Gij . To 
that end, we define 



Gij(z) := ^(G y (z) - Gij(z)) = G lk (z)G jk (z) . 



(3.1) 



where the second equality follows easily by spectral decomposition, Gij(z) = J2p "^-z^ - 



Note that 



Gi^E + i-q) = (jE _ A J )2 + ?? 2 ut)(i)up(J) 

as well as 1m Gu(z) = Gu{z). It is a triviality that all of the results from Section [2] hold with z replaced 
with z. 

The following lemma expresses the eigenvector components as an integral of the Green function over an 
appropriate random interval. 



Lemma 3.1. Under the assumptions of Theorem ] 1.61 for any e > there exist constants C\, Ci such that 
for r\ = N~ 2 / 3 ~ 6 we have 



Je u i 



lim max max E u 6 (Nu a (i)u a (j)) -E u 6 

N—>oo a^tpP i,j I 



Gij(E + itj) l(Aa_i < E~ «S A a ) dE 



where 



E ± := E±if Cl r/ 



I := 



-2 - N~ 2/3 ip C2 , -2 + N~ 2/3 ip C2 



0, (3.2) 
(3.3) 



and we introduce the convention X := —00. Here u stands for either v or w. 



Proof. We shall fix i, j and a ^ tp p ; it is easy to check that all constants in the following are uniform in 
and a ^ ip p . We write 



U a (i)Ua(j) = 



r) f u a (i)u a (j) 



dE. 



Using Theorem 12.31 it is easy to prove that for C\ large enough we have 

u a {i)u a (j) 



u a (i)u a {j) 



i) 



it J a (E - X a ) 2 + rf 



dE + O 



Nip 



Ci/2 



holds with high probability for some c > 0, as long as 

o < A~, b ^ A+, 
where we use the notation (|3.3[) . i.e. := A Q ± ip Cl r]. We now choose 



(3.4) 



(3.5) 



(3.6) 



in{^a»^-i}> b '■= K 



12 



By the assumption on 9 and using Theorem 12.31 we therefore find 



Now we split 



to get 



AE = I + d£H-l(A+_!>A-) / d£ 



E«fl(JV« a (<KCj)) = ( ^ / A " -M^#^d£ ] + 0(^°E U 1(A£_ 1 > A")) + (1) (3. 



for some constant Co, where we used Theorem 12.31 and the assumption on 9. Now the level repulsion 
estimate ()1.6|) implies that the second term of ()3.8[) is o(l). We now observe that, by f)2 . 10[) . we have 
A+ < -2 + N- 2 ^Lp C2 and A+_ x ^ -2 - N~ 2 / 3 ip C2 with high probability. It therefore easy to see that 

r«(jvs a (i) Uf ,(j)) =E u e(^l f M»KO) 1(Aa i ^ g _ < Aa)dj; \ +q(1) (3 9) 

Next, we replace the integrand in p.9[) by Gij(E + irj). By definition, we have 

+ - 2^ (f?-A^ + ^ + (E-x a y + ,f ■ (3 - 10) 

In order to be able to apply the mean value theorem to 9 with the decomposition (|3.10p . we need an upper 
bound on 

Nrj I m)Mi)\ dE ^ a.** + ^ E f V dE t (3 . u) 



where the inequality holds with high probability for any C3; here we used Theorem 12.31 Using 7^ ^ 
—2 + c(/3/iV) 2 / 3 as well as (|2.10[) , we find with high probability for large enough C3 

Thus the left-hand side of (|3~TTj) is bounded by ip Co+C3+1 . 

Let us abbreviate x(^) := l(A a — 1 ^ E~ ^ A Q ). Now, recalling the assumption on 9, we may apply the 
mean value theorem as well as Theorem 12.31 to get 
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for some constant C ^ C(Co + C3 + 1) independent of C\. We now estimate the right-hand side of (|3.13l) . 
Exactly as in (|3.12p . one finds that there exists C4 such that the contribution of (3 ^ ip Ci to the right-hand 
side of p,13[) vanishes in the limit N — > 00. Next, we deal with the eigenvalues (3 < a (in the case a > 1). 
Using Theorem 12.31 we get 

where c > for Ci large enough. 

What remains is the estimate of the terms a < (3 ^ ip Cl in (|3.13p . For a given constant C5 > we 
partition I = Ii U I2 with ^ fl /2 = and 

Ji := {£eJ:3^,a<K/ 4 , |£-A^ W Cs }. (3.14) 
It is easy to see that, for large enough C5, we have 

where c > 0. Let us therefore consider the integral over Jx. One readily finds, for A Q ^ A a +x ^ A^, that 

1(£T<A Q ) «C \ ^ ■ „ < (3.15) 



( J B-A /3 )2+ 7? 2 V - <*/ - {X ^_ Xa) 2 +r) 2 - (A Q+1 -A Q )2 +?7 2 

From Theorem 12.31 we therefore find that there exists a constant C§, depending on C±, such that 

P:a<PS<p<>* JlA* *P) +T I (A Q+ i-A«) +77 

The right-hand side of ([3~16| is bounded by E u l(|A Q+ i - A„| < A^V 72 ) + 0(N- e ). Using (H) we now 
obtain 

/3 : a</3^</3 c 4 1 

where c > 0. This concludes the proof. □ 

In a second step we convert the cutoff function in lemma 13.11 into a function of dj . 

Lemma 3.2. Recall the definition (|2.16[) of the approximate delta function 9 n on the scale ?y. Let a ^ ip p and 
q = q a : K — > R + 6e a smooth cutoff function concentrated around a — 1, satisfying 

q(x) = l if |x-a + l| < 1/3, q(x) = i/ |x - a + 1| ^ 2/3 . 

X := I^.b-], := -2-27V- 2 / 3 (logiV) L (3.18) 

where 

rj := N~ 2 / 3 - E , 77 := AT" 2 / 3 " 66 (3.19) 
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for e > 0. Then for e small enough we have (recall the definition (|3.3I) ) 



lim max max 

TV— >oo a^ipP i.j | 



(e u 6>( 



Nu a (i)u a {j)) -E u < 



N 

n 



G ij (E + i V )q[Tr( X *e fj )(H)) dE 



(3.20) 



Here u stands for either v or w. 
Proof. Note first that 

N 
n 



N 

7T 
N 

ft Jl 



— I Gij{E + in)H\x-i<E- ^Xc)dE = — j Gij(E + irj) l(J\f(—oo, E~) = a — l)dE 



with high probability. 

Next, recall that (|2TTg|) asserts that for I = N^ 2 / 3 ^ we have 

\Trx{H)-Tr{x*0ij)(H)\ C (iV~ e + Af(E~ - t, E~ + £)) 

with high probability for sufficiently large N . We therefore find that 



(3.21) 



- / Gij(E + in) l(Xa-i < E~ < Xa) dE - - / + iv)q[Tr( X * e n ){H)} dE 

ft J i ft Ji 

N r ~ 

^ CN zl / 1^(^ + 177)11(1-^- - Xp\^£)dE + (p c N~ E 
p=i ^ 

v ° r ~ 

^ CN Y. \ G a(E + ^)m\E- - X \ ^ £) dE + <p c N~ E 

8=1 J 1 



Cats 



s: Cip u N£ sup I dj (E + ir/)| + tp N 
Eel 



holds with high probability, where in the first inequality we estimated the integral Jj\Gij(E + ir]) \ dE exactly 
as p. lip , and in the second inequality we used (|2.10[) . Using the definition of / and (|2.8|) we get 



sup | 
Eel 



Together with (|3.2p , the claim follows. □ 
In a third and final step, we use the Green function comparison method to show the following statement. 
Lemma 3.3. Under the assumptions of Lemma \3.S\, we have 



lim max (E v - E v 

N— >oo i,j 



N f ~ 

- \ G ij (E + ir 1 )q[Tr(x*e fj )(H)] dE 
ft Ji 



0. 



The rest of this section is devoted to the proof of Lemma 13.3 
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3.1. Green function comparison: proof of Lemma 13. 31 The claimed uniformity in i and j is easy to 
check in our proof, and we shall not mention it anymore. Throughout the following we rename i = a and 
j = [3 in order to use i and j as summation indices. We now fix a and (3 for the whole proof. (Note that a 
and (3 need not be different.) 
We use the identity (see (|3.1|l ) 



Gij(z) = (Imz) ^Xij t k(z) , x ij,k( z ) := G ik (z)Gjk{z) . 



(3.22) 



We begin by dropping the diagonal terms in (|3.22[) . 
Lemma 3.4. For small enough e > we have 

- f G a0 (E + ir,)q[Tr( X *e fj )(H)] dE 



x{E)q{y{E))dE 



where u stands for either v or w, and 
Nn 



x(E) : 
PROOF. We estimate 



J2 X a ^ k (E + b]) 



y(E) 



k^a,/3 



/ Y,X iitk (E + ir))dE. 
•' El i^k 



(3.23) 



(3.24) 



N 



G a0 {E + ir))-x(E) 



sC tp c Nr] «C (p c N 1/3 - e 



with high probability and, recalling that q' is bounded, 

q[Tr(X*Ofj)(H)]-y(E) ^ tp c 'rjN N~ 2/3 sC tp c N~ 1/3 - 6£ 

with high probability. Therefore the difference of the arguments of 9 in (|3.23l) is bounded by ip c N~ x / 3 ~ £ 
with high probability. (Recall that |/| ^ ip c N~ 2 ^ 3 .) Moreover, since q is bounded, it is easy to see that 
both arguments of 9 in (|3.23|) arc bounded with high probability by 

ip c Ni 1 N~ 2/3 (l + N sup Al(E + iri)) ^ ip c N £ , 
Eel 

where we used Theorem 12.11 The claim now follows from the mean value theorem and the assumption on 
9. □ 

For the following we work on the product probability space of the ensembles H v and 7J W . To distinguish 
them we denote the elements of H v by N x / 2 Vij and the elements of ff w by N~ 1 / 2 Wij. We fix a bijective 
ordering map <f> on the index set of the independent matrix elements, 



$ : {(i,j) : 1 < i < j < N} -> {1,. . . , 7 max} , 7n 



N(N+ 1) 



and denote by H 1 the generalized Wigner matrix whose matrix elements hij follow the ^-distribution if 



*S 7 an( i the w-distribution otherwise. In particular, Hq = H v and H~ t 



H w . Hence 



E v - E v 



x{E)q{y{E))dE 



g r E (ff,-0 



7=1 



x(E)q(y(E))dE 



(3.25) 



1G 



(in self-explanatory notation). 

Let us now fix a 7 and let (a, b) be determined by $(a, b) = 7. Throughout the following we consider 
a, (3, a, b to be arbitrary but fixed and often omit dependence on them from the notation. Our strategy is to 
compare i? 7 _i with for each 7. In the end we shall sum up the differences in the telescopic sum (|3.25p . 

Note that Hj-\ and H 7 differ only in the matrix elements indexed by (a,b) and (b,a). Let E^' denote 
the matrix whose matrix elements are zero everywhere except at position where it is 1; in other words, 
E^p = SikSji. Thus we have 

Hy-i = Q + ^F=V , V := v ab E^ + v ba E^ , 
VN 



H 1 = Q + -^W, W := Wab EW +w ba E™ . (3.26) 
v N 



Here Q is the matrix obtained from H 1 (or, equivalently, from Hj-i) by setting the matrix elements indexed 
by (a, b) and (b, a) to zero. Next, we define the Green functions 

R := S := — . (3.27) 

Q — z — z 

We shall show that the difference between the expectation E^- 1 ) and E^ depends only on the second 
moments of v ab , up to an error term that is affordable even after summation over 7. Together with same 
argument applied to "& H ~i\ and the fact that the second moments of v ab and w ab are identical, this will 
prove Lemma 13.31 

For the estimates we need the following basic result, proved in [17] (Equation (6.32)). 
Lemma 3.5. For any rf := jV~ 2 / 3 ~ d we have with high probability 

sup m&x\Rij(E + U1') - 5 ijmsc (E + iri')\ sC A s := N~ l/3+2S . 

E ^ N -2/3 + c i,j 

The same estimates hold for S instead of R. 

Our comparison is based on the resolvent expansion 

S = R- N~ 1/2 RVR + N^iRVfR- N- 3/2 (RV) 3 R + N- 2 (RV) 4 S. (3.28) 
Using Lemma 13.51 we easily get with high probability, for i ^ j, 

\Sij-Rij\ < f C N^/ 2 A 2 - r where r := l(i e {a, b}) + l(j e {a, b}) . (3.29) 

Defining 

&Xij_k '■= SikSjk — RikRjk ) (3.30) 
we therefore have the trivial bound with high probability 

\AX ijtk \ < <p c N-V 2 A 3 e - s (k^i,j), (3.31) 

where we abbreviated 

s := max|l(i e {a, b}) + l(k € {a, b}) , l(j G {a, 6}) + l(fcG{a,6})} 

= l({i,i}n{a,&}^0)+l(feG {a, 6}). (3.32) 
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The variable s counts the maximum number of diagonal resolvent matrix elements in AXij_j~- The bookkeep- 
ing of s will play a crucial role in our proof, since the smallness associated with off-diagonal elements (see 
Lemma l3.5p is needed to control the resolvent expansion p.28[) under the two-moment matching assumption. 

From now on it is convenient to modify slightly our notation, and to write 'EF(Hj) instead of E,( H ~<> F(H). 
We also use E a & to denote partial expectation obtained by integrating out the variables v a b and w a b- 

By applying (|3.28[) to (|3.30p and taking the partial expectation E a b, one finds, as above, that there exists 
a random variable A\, which depends on the randomness only through Q and the first two moments of v a b, 
such that for k ^ i,j and s as in p.32[) we have with high probability 

|E Qb AX^ fe - A x \ < ^N-^A 3 -* . (3.33) 

Using this bound we may estimate 

Ax(E) := x s (E)-x R (E), Ay(E) := y s (E)-y R (E), (3.34) 



with the convention that a superscript S denotes a quantity defined in terms of the matrix i? 7 _i, and a 
superscript R a quantity defined in terms of the matrix Q. 

Lemma 3.6. For fixed a, /3, a, b there exists exists a random variable A, which depends on the randomness 
only through Q and the first two moments of v a b, such that 



El 



x s (E)q(y s (E))dE 



-E0 



x R (E)q(y R (E))dE = A + o(N- 2+t +N- 2+1(a=b) ) , (3.35) 



where t := |{a, b} n {a, /3}\ . 

Before proving Lemma 13.61 we show how it implies Lemma 13.3 



PROOF of Lemma 13.31 It suffices to prove that each summand in (|3.25p is bounded by o(iV~ 2+ * + 7V~ 2+1(a=b )) . 
This follows immediately by applying Lemma [3~B1 to S = (-ff 7 -i — z)^ 1 and S' := {H 1 — z)^ 1 and subtracting 
the statements; note that the random variables A in the statement of Lemma 13.61 are by definition the same 
for S and 5'. □ 



Proof of Lemma 13.61 Throughout the proof of Lemma [3~6l we shall abbreviate H = Fl 1 -\ = (h[j), as well 
as S = S(z) = (H - z)' 1 . 

Since E S I (recall (|3.3[1 ) we get from Theorem 12.11 that with high probability 

atCe 

\x(E)\ < tp c N 2 r)A 2 ^ , (3.36) 

V 

which implies 

J \x(E)\dE < N Ce . (3.37) 

Here we adopt the convention that if x or y appears without a superscript, the claim holds for both super- 
scripts R and S. Similarly, we find with high probability 

\y(E)\ < fjip c N~ 2/3 N 2 Al E < N Ce . (3.38) 
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Next, in the definition of x{E) and y{E) we condition on the variable s defined in (|3.32l) by introducing, 
for s = 0,1,2, 



x s (E) 
Vs{E) 



Nr) 
n 



X a p, k (E + ir 1 )l(s = l({a,l3}n{a,b}^®)+l(ke{a,b})) , 

E~ 



As above, s is a bookkeeping index that bounds the number of diagonal resolvent matrix elements appearing 
in the resolvent expansion. 

We abbreviate Ax s (E) := xf(E) — xf(E) and Ay s (E) = yf(E) — y R (E). Recalling the definition 
t = | {a, b} n {a, /3}|, we find with high probability 



\Ax s (E)\ < ^AVV-^AS-'W 1 ^ 1 ^ )) < 



jy3/2-t-Ce ' 



(3.39) 



where we used Theorem 12.11 and the elementary inequality s + l(s = l(t > 0)) ^ t + 1 which holds if 
x s {E) ^ 0. Thus we get with high probability 



J^\Ax s (E)\dE < 



jy-3/2-t-Ce 



^y-5/6jy-2s/3+t+C£ 



(3.40) 



Now we may argue similarly to (|3.33[) . We find that, for any ^-dependent random variable / = f(E) 
independent of h a b, there exists a random variable A%, which depends on the randomness only through Q, 
f, and the first two moments of h a b, such that with high probability 



(E ab Ax s (E))f(E)dE~ A 2 



l(fi) s? ||/ 1(0)1100 jv-ii/6 jv-a«/a+*+cre 



(3.41) 



where fl is any event. Note that, as in (|3.33[) . we find that (|3.41j) is suppressed by a factor iV _1 compared to 
(|3.3ip . This may be easily understood, as the leading order error term in the resolvent expansion of (|3.31|) 
is of order 1 in H , whereas the leading order error term in (|3.41[) is of order 3 in H . These error terms have 
the same number of off-diagonal elements (estimated using Lemma |3.5[) . and the same entropy factor of the 
summation indices. 

We may derive similar bounds for y s (E). As in (|3.31|) . we have with high probability 



\Ay s (E)\ ^ <p c fjN- 2 / 3 N 2 - s N-V 2 Al- s sC N -6/e N -2./3+Ce 



(3.42) 



Furthermore, we find that there exists an independent random variable As(E), which depends on the 
randomness only through Q and the first two moments of h ab , such that with high probability 



E 



ah 



lJ Ay s (E) - A 3 (E) 



< N~ 11/6 N~ 2s/3+Ce 



(3.43) 



After these preparations, we may now estimate the error resulting from setting h ab to zero in the expres- 
sion ~E9 [fj x{E) q(y(E)) dE] . Recalling the conditioning over s = 0, 1, 2, we find 

x s q(y s ) dE = 9 J (x R + Ax Q + Azi + Ax 2 ) q(y R + Ay Q + A Vl + Ay 2 ) dE 
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here and in the following we omit the argument E unless it is needed. Using (|3.42[) we have with high 
probability 



Ax + A Xl + Ax 2 ) q(y R + Ay a + Ay 1 + Ay 2 ) dE 



^(x R + Ax Q + Ax x + A.t 2 ) (q{y R ) + q'(y R ) (Ay + A Vl ) + q" (y R )(Ay Q ) 2 ) dE 



+ o(N~ 2 ). 



The use of the mean value theorem for e small enough is easy to justify using the assumption on 9 and the 
bounds (|3.37[) and (|3.38p . In the following we shall no longer mention such estimates of the argument of 
derivatives of 9, which can always be easily checked in a similar fashion. 

Recall that an error of order o(N~ 2+t ) is affordable in the error estimate. Thus, using the basic power 
counting given by ([337)) . (|3~38| . (pT40)) . and (|3~42|) . we find with high probability 



x b q{y b )dE 



x H q(y H )dE 



0' 



Jx R q(y R )dE 



1 ( (Ax + A Xl ) q(y R ) + x R q'{y R ) (Ay + A Vl ) + Ax Q q'(y R )Ay + x R q" (y R )(Ay )^ dE 



x R q(y R ) dE 



I (Ax q(y R ) + x R q'(y R )Ay a ) dE 



+ o(N~ z+t ) . (3.44) 



We now start dealing with the individual terms on the right-hand side of (|3.44|) . 

First, we consider the terms containing Axi and Ay±. Applying (|3.41j) and (|3.43l) we find that there 
exists a random variable At, which depends on the randomness only through Q and the first two moments 
of h a b, such that 



ab / {Ax iq (y R )+x R q'(y R )A yi ) dE - A4 



o{N~ z+t ) 



(3.45) 



with high probability. Inserting this into (|3.44j) . we find with high probability 



E ab e 


J^x s q(y s )dE 


-E ab 9 


J x R q{y R )dE 


= E ab 9' 


J x R q{y R )dE 
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' (Ax q(y R ) + x R q'(y R )Ay + Ax q' (y R )Ay + x R q" (y R )(Ay ) 2 ) dE 
J x R q(y R ) dE J (Azq q(y R ) + x R q'(y R )Ay Q ) dE 



Enh 9" 



+ A i + o(N- 2+t ). (3.46) 



Thus we only need to focus on the error terms Axq and Ayo- Note that we have 



A:ro(£) = l(i = 0)— '- V AX aP . k (E + i v ) 

7T — ' 

k^a,/3,a,b 

Ay (E) = fj dE^ Hh k £ {a, b}) AX iitk (E + iff) 



(3.47) 
(3.48) 
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Recall that the (i, j)-component of the resolvent expansion (|3.28[) reads 

Sij = (R - N~ 1/2 RVR + N-\RV) 2 R - N- 3/2 (RV) 3 R + N- 2 {RV) 4 s) . (3.49) 



Now we assume that i ^ j and H {a,&}| = 0. It is easy to see that this assumption holds for any 

matrix element in the formulas (|3.47p and p.48|) . Then we can use Lemma [3.51 to estimate the m-th term 
as follows: 

N- m ^[{RV) m R].. N' m / 2+Ce N- 2 ^ , N- 2 [{RV) 4 S] tj sC N -m+Ce > 
with high probability. 

Next, we apply the resolvent expansion to Xij_ k . Note that in our applications errors of size 0(N~ S ^ 3 ~ C ) 
are affordable in AX^-fc for some c > independent of e (see (|3.23[) and (|3.24I0 . Now let us assume that 
the indices a, b, k satisfy the condition 

(*) {*> i} H {a, b} = and k ^ i, j, a, b. 

In the applications we shall set i — a and j = /3 in (|3.47[) . and i = j in (|3.48[) . In both cases, it is easy to 
check that the condition (*) is satisfied for nonvanishing summands. 

We can therefore separate AJy^ into three parts, indexed according to how many V-matrix elements 
they contain, 

AX ijik = Ax\]\ + AX\ 2 1 + AX® + 0(N' 3+C ^) (3.51) 
with high probability; here we defined 

A*£A : = -N-^RikiRVRljk + iCh, (3-52) 
AX (2) k := N~ 1 R lk (RVRVR) jk + N- 1 {RVR) lk {RVR) jk + [C]i , (3.53) 
AX { ^ k := -N- 3 / 2 Ri k (RVRVRVR) jk - N- 3 / 2 (RVR) ik (RVRVR) jk + [C] a , (3.54) 

where [0\i, I = 1,2, means the complex conjugate of the first I terms on the right-hand side with i and j 
exchanged. Furthermore, it easy to see that the second term on the right-hand side of (|3.54[) is of order 
Q( N -l7/6+Cey Tllus wc find with mgh probability 

-Axf k = N- 3/2 R lk {RVRVRVR) jk + N- 3/2 (RVRVRVR) tk R jk + 0(N- 17/6+Ce ) 

= Y + 0{N- l7 ' 6+Ce ) , (3.55) 

where Y is a finite sum of terms of the form 

N- 3/2 Rik(RjaV ab R bb V ba R aa V ab R bk ) (3.56) 

and terms obtained from (|3 . 56[) by (i) taking the complex conjugate and exchanging i and j, and (ii) 
exchanging a and b. Using Lemma 13.51 we find that (|3.56p is equal to 

N-^ 2 R. lk (R ja V ab R bb V ba R aa V ab R bk ) = N- 3 ' 2 ri[ c R lk R ]a V ab V ba V ab R bk + 0(N- 17 / 6+Ce ) 

with high probability. The splitting p.51|) induces a splitting 

Ax = Ax { o ] + A4 2) + A4 3) + 0(N- 5/3+Cs ) (3.57) 
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with high probability in self-explanatory notation. It is easy to see that 

\AX^\ < 7V-V6+Ce ; | Aaj (2)| ^ N -4/6+Ce } | ^(3) | ^ 

From p.47[) and (|3 . 56[) , we find that Ax$ is a finite sum of terms of the form 



i(* = o) E 



k^a,fi,a,b 



(3.58) 



(3.59) 



with high probability, where the other terms are obtained from (|3.59[) as described after p.56[) . 
Similarly, we find 



Ay = Ay« + Ay^ + Ay^ + 0(iV" 7 / 3 + C£ ) 



and 



-8/6+Ce 



|A^ 3) | < N- 



-11/6+Ce 



(3.60) 



(3.61) 



Now we insert these bounds into (|3.46|) . Recall that the upper index I in Ax^ and Ay^ counts the 
number of ^-matrix elements. Thus we find, recalling (|3.46[) and the power counting estimates (|3.58[) and 
(|3.61j) . that there is a random variable A$, depending on the randomness only through Q and the two first 
moments of h a b, such that 



x b q(y b )dE 



J x R q(y R )dE 
x R q(y R )dE 



I (Ax^q(y R )+x R q'(y R )Ayi 3) dE 

+ A 4 + A 5 + o{N~ 2+t ) . (3.62) 



with high probability. Moreover, by the same power counting estimates we find that the second line of (|3.62[) 
is bounded by o{N~ 1 ). We use this rough bound in the case a = b, and get 



x s q(y s )dE 



x R q(y R ) dE 



l(a^b)E ab 6' 



x R q(y R )dE 



Ax'i ) q{y R )+x R q l {y R )Ay { i ) dE 



+ A 4 + A 5 + o(N- 2+t ) + o(N- 2+lt > a = b) ) (3.63) 



with high probability. 

Hence Lemma 13.61 is proved if we can show that, for a^b, we have 



E 9' 



J x R q(y R ) dE J (A4 3) q(y R ) + x R q'{y R )Ayf ) ) dE 



= o(N- 2 ) 



with high probability. This is proved below. 



(3.64) 
□ 
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Proof of (|3.64[) . We shall prove, for a ^ b, that 

x R q(y R ) dE A4 3) q{y R ) dE 



o(N- 2 ). 

The other term on the left-hand side of (|3.64p is estimated similarly. Let us abbreviate 



(3.65) 



B 1 



x R q(y R ) dE 



(3.66) 



From (|3.36[) and the assumption on 9, we find that \B R \ ^ N Ce with high probability. 



We shall estimate the contribution to (|3.65[) of one term of the form (|3.59[) . Recalling that E a b \V ab 



0(1) and m sc = O(l), we find the bound 



N -l/6+Ce max 



E B R R ak RpaRtk q(y R ) dE 



+ o(N- 2 ) 



<C jV-5/6+Ce max gup 



EB R R ak Ri3 a R bk q(y R ] 



+ o(N- 2 ). (3.67) 



The proof of (|3.64j) is therefore complete if we can show that, assuming the sets {a, (3}, {a}, {&}, {k} are 
disjoint, we have 

ER ak (R Pa R bk )B R q(y R )\ ^ N^ 3 ^ . (3.68) 
In order to prove Q3.68[) . we first use a simple resolvent expansion to show that with high probability 

< N -i/3+Ce 



Rak{Rp a Rbk) B R q(y R ) - S ak (S 0a S bk ) B S q(y S ) 
where B s is defined analogously to (|3.66[) with R replaced by S. Therefore it suffices to prove 

ES ak {SpaS bk )B*q( y a ) s v- 4 / 3+Ce 



(3.69) 



(3.70) 



In order to complete the proof, we introduce some notation. Recall that H = H 1 _i and S = (H — z) — 1 . 
We define as the matrix obtained from H by setting its a-th column and a-th row to be zero. For 

any function F = F{H) we define F^ := F(H^). We now remove the a-th row and column from H in 
(|3.70[) . which we can do with a negligible error. The key identity is the following resolvent identity, proved 
in Lemma 4.2 of [T5]: For k ^ i,j we have 



Si 



s 



(k) 



Sik S k j 

s kk 



Using (|3.71|) . one readily sees that 

SakiSpaS^B^iy 3 ) - S£>S/}aS$ & 



(x s ) (a) q((y s )^)dE' 



(3.71) 



(3.72) 



Moreover, we have 



s^s 0a sith' 



(x s ) (Q) q((y s ) ia) )dE' 



— S ak Si 



bk 1 



x s q(y s ) dE' 



(a) _ 

o(^)l S 0a . (3.73) 
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Next, we claim that the conditional expectation - with respect to the variables in the a-th column of H 
of Sp a is much smaller than its typical size. To that end, we use the identities, valid for i ^ j, 



Sij — Su'^ / hikSf.j , Sij — Sjj Sjl) h kj , 



proved in [5], Lemma 6.10. Now using (|3.74l) we find 



Spa — ^ ' SggSgj hjg — m sc Spj hj a + (S a a m sc)^^ Spj hj a . 



(3.74) 



(3.75) 



The conditional expectation with respect to the variables in the a-th column of H applied to the first term 
on the right-hand side of (|3.75p vanishes; hence its contribution to the expectation of (|3.73p also vanishes. 
In order to estimate the second term on the right-hand side of (|3.73[) . we note that with high probability 

\Saa-m sc \ < N ~^ +Ce , 

by Lemma [3~5l Moreover, using the large deviation bound (3.9) in [17], we get with high probability 

1/2 



where in the last step we used (|3.71[) and Lemma 13.51 Putting everything together, we find that the 
expectation of (|3.73|) is bounded in absolute value by N~ 4 / 3+Ce . By (|3.72p . this completes the proof of 
([3"770|) . and hence of (|3TM|) . □ 



4. Extension to eigenvalues and several arguments 

In this section we describe how the arguments of Section [3] extend to general functions 9 as in p. lip . 
Consider first the case of a single eigenvalue, Xp, in which case the claim reads 

lim [E v -E w l6>(7V 2 /3( A(3 _ 7(3 ) N ) = o, (4.1) 

N— too V / 

uniformly in ft ^ tp p . Denote by g v and p w the laws of Xp in the ensembles H v and 7? w respectively. Using 
Theorem [Owe find 

E u 6(N 2 / 3 (Xp ~ 7(3 )) = J 9(N 2 / 3 (E - 7/s )) g»(dE) + 0(e~^) , (4.2) 

where u stands for either v or w, and I was defined in (|3 . 3[) . Now integration by parts yields 

[E v -E w ]e(N 2 / 3 (Xp -jp)) = -[E v -E w ] J^N 2 ' 3 6'{N 2 ' ? '{E- 1 p))l{Xp ^ E) dE + O^") , (4.3) 

where the boundary terms are of order 0(e~ v ) by Theorem 12.21 Next, we choose a smooth nondecreasing 
function rp that vanishes on the interval (— oo,/3 — 2/3] and is equal to 1 on the interval [/3 — 1/3, oo). 
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Recalling the definition (|2.15|) . we get from (|4.3j) 

[E v - E w ] 9{N 2 I 3 {\ P - 7/J )) = - [E v - E w ] J N 2 / 3 9'{N 2 ' 3 {E - 7(3 )) rp(Af(E L ,E)) dE + 0{e~^) 

= -[E v -E w ] f N 2 ' 3 8'(N 2 / 3 (E - lf3 ))r fj (Tr(l [ELiE] * 6 fl )(H)) dE + 0(ip c N~ s ) 



where in the second step we used the assumption on that r'„ is bounded, and Lemma 12.71 with fj 
jy-2/3-6e_ More precisely, we apply Lemma [2. 71 to estimate, with high probability, 

tpCtf/a J dE |Tr(l [SljB] * 6r,){H) - N{E L ,E) 

<; <p c N 2/3 J dE^N- £ +Af(E - N- 2/3 - £ ,E + N- 2/3 - £ )^j 

v ° r 

N 2 ' 3 J2 dEl(\E-X a \^ N~ 2 ' 3 ~ £ ) 

a=l ^ 7 



«S ip c N- £ + (p c ~> 

where the first step follows from (|2 . 1 8[) and the second from Theorem 12.2 
Integrating by parts again, we find with high probability 



(4.4) 



i v -E w j0(iV 2 / 3 (A / ,-7 /3 )) 

= [E v - E w ] J 6{N 2 / 3 {E - 7(3 )) r' p (Tr(l [EL>E] * 6 n )(H)) Nlmm(E + if}) dE + 0( V c N- £ ) . 

Now we may apply the Green function comparison method from Section [3. II In fact, in this case the analysis 
is easier as we have no fixed indices i and j to keep track of. 

The general case, 9 as in is treated similarly. Repeating successively the above procedure for each 

argument A^, . . . , \p k , we find that there is a constant C k , depending on fc, such that 

[E v - E w ] e(N 2 ' 3 {\ Pl - 7ft), ... , N 2 / 3 {\ Pk - lh ),Nu ai (h)u ai (ji), • • ■ . N ^ k (i k )u ak ( Jk )) 
= [E v - E w ] J k dE,--- dE k 6(N*t\Ek - 7ft ), • ■ • , N 2 ' 3 (E k - 7/3 J; Ck) 



n 



r' pi (Tr(l [Bl >El] *0-)(H))N lmm(E l + ifj) 



+ 0(tp Ck N~ £ ), (4.5) 



where we introduced the shorthand 



N 



■= - dEG kjl {E + ir))q ai [Trl 



[E L ,E~] * 



and set rj := N 2 / 3 £ ; q a is the function from Lemma 13.21 Here at each step we used the assumption on 9, 
that r'a is bounded, and the estimate 



dENImm(E + ifj) ^ Af(-oo, E + n- 2/3 - £/10 ) + N- £/1 ° < Nn sc {-2 + (p c N- 2/3 ) + 1 sC ip c , (4.6) 
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where in the first step we used (|2.19j) . in the second Theorem l2.2l and in the third the definition (|1.7p of n sc . 

The randomness on the right-hand side of (|4.5|) is expressed entirely in terms of Green functions; hence 
(|4.5[) is amenable to the Green function comparison method from Section [3. II The complications are merely 
notational, as we now have 2k fixed indices • ■ ■ ,ik>jk instead of just the two 



5. Eigenvectors in the bulk: proof of Theorem 11.101 

In this section we prove Theorem 11.101 In the bulk the eigenvalue spacing is of order ./V -1 as opposed to 
iV -2 / 3 at the edge. Thus, we shall have to take spectral windows of size r\ = N^ 1 ^ 6 . To that end, we begin 
by extending the strong local semicircle law from Theorem 12.11 to arbitrarily small values of r/ > 0. Recall 
the notation z = E + if]. 

Lemma 5.1. For any \E\ ^ 5 and < rj ^ 10, we have with high probability 



max|G.(z)-^m sc (z)| < f + ^.l) 



for large enough N . 



PROOF. By Theorem 1 2. 1[ we only need to consider 77 ^ y := ip Cl N 1 for some C\ > 0. We use the trivial 
bound 

TmGn(E + vj) < V-lmGuiE + iy) for rj^y, 
V 

as well as 

I G^- + I < ClogNmBxlmG kk {E + vi), 

k 

which follows by a simple dyadic decomposition; see [TS], Equation (4.9). Thus we get 

c 

iGijiE + it])] ^ C log N^- maxim G kk (E + iy) < y cV - . 

rj k j] l\rj 

This completes the proof. □ 

The strategy behind the proof of Theorem ll.lOl is very similar to that of Theorem [L6l given in Section [3] 
In a first step, we express the eigenvector components using integrals involving resolvent matrix elements Gij] 
in a second step, we replace the sharp indicator functions in the integrand by smoothed out functions which 
depend only on the resolvent; in a third step, we use the Green function comparison method to complete 
the proof. 

For ease of presentation, we shall give the proof for the case 9 — 9(Nu a (i)u a (j)); we show that 

lim \E V -E v ']9(Nu a (i)u a (j)) = 0, (5.2) 

where pN ^ a ^ (1 — p)N. As outlined in Section^ the extension to general functions 9, as given in (|1.15l) . 
is an easy extension which we sketch briefly at the end of this section. 
We now spell out the three steps mentioned above. 

Step 1. The analogue of Lemma T3. II in the bulk is the following result whose proof uses (|1.5[) and Lemma 
12.51 and is very similar to the proof of Lemma 13. II (in fact somewhat easier). We omit further details. 
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Lemma 5.2. Under the assumption of Theorem \1.1Q[ for any e > there exist constants C\, C 2 such that 



for r\ = N 



— 7V"-l-e 



have 



lim max max 



9{Nu a (i)u a {j))-E u < 



N 

n Jlc 



Gij (E + in) l(Aa_i sc ET ^ A a ) dE 



where 



7a- N' 1 ^) ', 7a + JV _1 (VAr) C 



E± := S±(^) Cl ?7, / a : 
and we introduce the convention Ao = —00. /fere u stands for either v or w. 

Step 2. Wc choose 77 = N^ 1 ^ 6 for some small enough e > and express the indicator function in 

E u — / G y (E + i?y) l(A a _i < E" A a ) dE 
. n J 1 

using Green functions (as before, we write I a = I). Using Theorem 12.21 wc know that 



= 0, 
(5.3) 

(5.4) 



(IST51 



N 

I" J I 



G i:j (E + in) 1 (Af(E L ,E-) = a-l)dE 



o(l) 



(5.5) 



(5.6) 



where E L := -2 - ip c N- 2 ^ 3 . 

As explained in Section 11.31 the approach in Step 2 has to be modified slightly from the one employed in 
Section [H The reason is that the size of the interval [El,E~] is no longer small, but of order one. 

For any Ei,E 2 € [—3,3] and rja > wc define /(A) = /b 1 ,b 2 ^ <£ (A) to be the characteristic function of 
[Ei,E 2 ] smoothed on scale rj d ; i.e. / = 1 on [E 1 ,E 2 ], / = 0onK\[£ 1 - n d ,E 2 + r)d] and \f'\ ^ Cij^ 1 , 
I /"| ^ Crj^ 2 . Let q = q a : K — > M + be a smooth cutoff function concentrated around a — 1, satisfying 



1 if |a;-a + l| < 1/3, g(x) = if \x - a + 1| ^ 2/3 . 



Now we choose 77^ := JV 1 rf£ , for some fixed d > 2. Then, using Lemma EH and an argument similar to the 
proof of Lemma 13.21 we find that 



E u 6» 



N 



(E)) dE 



0(1). 



(5.7) 



To simplify notation, wc follow the conventions of Section [3] in writing I = I ai q = q a and /e = fE L E-,r> d i 
and set a = i and /3 = j. In this notation, we need to estimate 



[E v - E w ] 9 



N 

- I G Q(3 (E + i77) (Tr /£(#)) dE 



(5.8) 



Now we express /e(E) in terms of Green functions using Helffer-Sjostrand functional calculus. Let x(y) be 
a smooth cutoff function with support in [—1, 1], with x(y) = 1 f° r \y\ ^ 1/2 and with bounded derivatives. 
Then we have (see e.g. Equation (B.12) of [13] ) 



i^(e)xW + ife(e)xV) - o f' E {e) X > {a) 
A — e — ia 



de dcr . 



(5.9) 
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Thus we get 
Trf E (H) 



N 
2^ 



iafUeMv) + i/ij(e)x'(0 - ^(e)x» )m(e + icr) deda 



^ (i/E(e)x'(o-) - o-/s( e )x'( ')) m ( e + icr ) dedtJ 

iiV /" /" iN f fld f 

+ da X {<j) def E (e)<jm{e + icr) + — da d 



ie f E {e) crm(e + icr) , (5.10) 



where we introduced the parameter fjd := N 1 ( d+1 ) s . We shall treat the last term of (|5.10[) as an error 
term. From (|5.1[) we find with high probability 

v ' N 

Therefore the third term of (|5.10[) is bounded, with high probability, by 



iN 

7T 



Id 



dcr / de f E {e) am(e + icr) 



(5.11) 



where in the first step we used that J \f E (e)\ de = 0{-q d 1 ). 

Step 3. We estimate (|5.8[) using a Green function comparison argument, similarly to Section l3~T1 As in 
Section [3Tl we use the notation 



J2 G ak {E + irj)G pk {E + \ri). 



Similarly to Lemma |3.4[ we begin by dropping the diagonal terms. Using Lemma 15.11 we find 

N 



-G a p{E + iri)-x{E) 



dE ^ ip c Nrf ^ N 



-l + Ce 



with high probability, so that it suffices to prove 



[E v - E w ] 9 



x(E)q(Trf E (H)) dE 



Using (|5.11|) we find that it suffices to prove 



[E v - E v 



x(E)q(y(E)+y(E))dE 



o(l) 



where 



N f 

V(E) ■= 7T / ^f E {e) X (a)m(e + ia)l{\a\ >fj d )deda, 
2tt J R 2 

y(E) := ^ f (if E (e)x'(a)-af E (e)x'(cj))m(e + ia)deda. 



(5.12) 

(5.13) 
(5.14) 

(5.15) 

(5.16) 
(5.17) 
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By a telescopic expansion similar to (|3.25p . we find that (|5.15j) follows if we can prove, with high probability, 



= ,4 + o(7V 1(a=b) - 2 ), (5.18) 

where we use the notation of Section [3. II here A is a random variable that depends on the randomness only 
through Q and the first four moments of h a b ii a =/= b, and the first two moments of h a b i£ a = b. (As in 
Section [3. 1[ E denotes expectation with respect to the product measure of the v and w ensembles.) 
Now we prove (|5.18[) . We use the resolvent expansion 

S = R- N- 1/2 RVR + N~ 1 (RV) 2 R - N- 3/2 (RV) 3 R + N- 2 (RV) 4 R - N- 5/2 (RV) 5 S. (5.19) 

Similarly to Section [3. 1[ we decompose 

Am := m s — m R = Attiq + Ami , 

where 

Am r := —^{S ii -R ii )l(r = l{ie{a,b})). 

i 

Using (|5.19[) we can expand Am r , for \<j\ ^ fjd and with high probability, 

4 

Am r {e + ia) = ^ Am< p »(e + ia) + 0(N- b/2+Ce k 2 - 2r N~ r ) , (5.20) 
P =i 

where 

|Am^| N - P /2+Cs K 2-2r N -r 

with high probability, and Ami^ is a polynomial in the matrix elements of R and V, each term containing 
precisely p matrix elements of V; here we set A CT := sup| e |^ 5 max^jlG^e + ier)!. Putting both cases r = 1, 2 
together, we get, for \a fjd\ and with high probability, 

4 

Am = Am(p) +0(N- 5 /' 2+CE (Al +N- 1 )) , |Am( p )| N~ p / 2+Ce (A* + A -1 ) . (5.22) 
p=i 

We may now estimate the variables x, y, and y. Let us first consider the variables y. From the definition of 
X, we find that in the integrand of (|5.17|) we have a ^ c and therefore by Theorem l2.1l we have A CT s; ip c A -1 / 2 
with high probability. Thus we get from (|5.17p 

4 

Ay(E) = J2 A y iP} ( E ) + °( N ~ 5/2+C£ )> l A y (p) (^)l < N~ p l 2+Ce (5.23) 
P =i 

with high probability. 



E6 



x s (E)q((y + y) s (E)) dE 



-El 



x R (E)q((y + y) R (E))dE 
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In order to estimate the contributions of the variables y, we integrate by parts, first in e and then in a, 
to obtain 



N / af E (e)x(<r)Am^(e + ia)l(\a\ >%)deda 



cn 



def E (e)fj d Am^(e + ifj d ) 



CN 



def' E (e) / dax'i^Am^ie + ia) 



CN 



def E {e) / dax(<j)Am^(e + i<j) 



(5.24) 



Using (|5.22[) . it is easy to see that the sum of the two first terms of (|5.24l) is bounded by N p/ 2 + Ce . In order 
to estimate the last term of (|5.24[) . we use (|5.22[) and (|5.1[) to get the bound 

CN f 1 da ( — — h 7 -L 5 + iV" p/2+Ce < N~p/ 2+Cs . 



Thus we find that 



Ay{E) = J2 A y (p) ( E ) + °( N ~ 5/2+C£ )' \Ay (p \E)\ ^ N~ p/2+Ce 
P =i 



(5.25) 



with high probability. 

Finally, as in p.40|) . we find that 



Ax(E) = J2 AxiP) ( E ) + °( N ~ 3/2+C£ ) 
P =i 



with high probability. Moreover, we have the bound 



|Ax (p) (£0|d£ < N~ p / 2+Ce 



(5.26) 



(5.27) 



with high probability. This concludes our estimate of the terms in the resolvent expansion of x,y, and y. 

Now using the power counting bounds from (|5.23l) . (|5.25l) . (|5.26|) . and (|5.27|) . we may easily complete the 
Green function comparison argument to prove (|5.18|) . as in Section [3.11 

Finally, we comment on how to deal with more general observables 6, as in (|1.15l) . The basic strategy is 
the same as in Section SJ In fact, the argument is simpler because the errors made in replacing sharp indicator 
functions with smooth indicator functions are easier to control in the bulk. In Section [4j the relatively large 
errors arising from the soft edge of the function 1\ El ,e] * Qfj were controlled by Lemma 12.71 In the bulk, we 
replace 1[e l ,e] with the function f E whose edges are sharper. Thus, in the bulk the error resulting from this 
replacement is bounded by N(E~ — r]d, E~ + rjd), whose integral may be estimated exactly as in (|4.4|) . The 
estimate (|4.6p is replaced by the trivial estimate Jj dE \d E Tr f E (H)\ ^ ip c . 
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